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Abstract 

The properties of the amplitude method for B s oscillation analyses are studied in 
detail. The world combination of measured amplitudes is converted into a likelihood 
profile as a function of oscillation frequency. A procedure is proposed to estimate 
the probability that the minimum observed is due to a statistical fluctuation. This 
method, applied to the data available at the time of 1999 Winter Conferences, gives 
1 - C.L. ps 0.03. 
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1 Introduction 



The first direct search for B s — B s oscillations was published by ALEPH in 1994 |L|]. Since 
then, many sophisticated analyses have been developed by the LEP experiments, SLD and 
CDF 0. None of these analyses has yet been able to measure the oscillation frequency, 
but can exclude a range for the mass difference between the two B s mass eigenstates, 
Am s , which drives the oscillation. 

In order to combine the information provided by the different analyses in the absence 
of a measurement of Am s , a new technique ||, known as the amplitude method, was pro- 
posed. The fit to the reconstructed proper time distribution of events tagged as mixed 
or unmixed is performed with a fixed frequency uj of the oscillating term, while its am- 
plitude A is left as the free parameter. A scan in uj is performed and at each value the 
amplitude is measured. Averaging values from different analyses is straightforward. The 
expected value of the amplitude is unity when uj = Am s (throughout this paper uj stands 
for the frequency folded in the fitting function, Am or Am s indicate the frequency of the 
oscillations in the sample analysed). 

The range of uj for which the amplitude is found to be compatible with zero and 
incompatible with unity can be excluded. An analysis (or a combination of analyses) has 
sensitivity in a given range of uj if the expected error on the measured amplitudes is small 
enough compared to unity, so that the two values ^4 = 0. and A = 1. can be distinguished. 
In order to quote a sensitivity limit, it is normally chosen to determine the value of uj for 
which a measured value ^4 = 0. implies that A — 1. is excluded at 95% CL. This happens 
when 1.645 x 04 = 1. 

The worldwide combination, at the time of the 1999 Winter Conferences, of amplitude 
analyses shows a deviation from A = at uj ~ 15 ps -1 , close to the sensitivity limit of 
14.3 ps -1 @], which could suggest the presence of an oscillating signal. The combination 
includes many preliminary analyses, and some new analyses are still expected from SLD 
and the LEP experiments. The significance of the structure observed might therefore be 
reduced or further enhanced in the near future as ongoing analyses are completed and 
published; in any case a procedure to quantify the probability that the observed structure 
corresponds to a B s oscillation signal is needed. A method is proposed here, based on the 
generation of toy experiments designed to be equivalent to the world combination of Am s 
analyses. 

The paper is organized as follows. 

In Section § the combined amplitude at different values of uj is presented. The com- 
bined likelihood profile as a function of uj is extracted from the amplitude spectrum. 

In Section |^ the properties of the amplitude method are investigated. Analytical ex- 
pressions for the expected shape of the measured amplitude and its error are derived. The 
small and large frequency limits are discussed, proposing an approximate interpretation 
in terms of Fourier transformations. The probability of observing statistical fluctuations 
which would fake a signal in a sample with frequency far beyond the sensitivity is also 
discussed. 
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In Section f| the structure and the features of the toy experiment generator used 
throughout the paper are described. A procedure to tune the parameters of the simulation 
in order to reproduce the observed errors is given. 

In Section [5] a procedure to extract a confidence level value from the likelihood function 
is presented and discussed. The uncertainty arising from the lack of a detailed simulation 
is investigated. 



2 The experimental results 

The combined amplitude measurements obtained from published and preliminary analyses 
available at the time of the 1999 Winter Conferences j|] is presented in Fig. |l|. 

The observed limit is significantly smaller than the expected limit (i.e., that which 
would be obtained if Am s were infinitely large), due to positive amplitude values measured 
in the region close to the sensitivity limit. 

As mentioned in the introduction, the amplitude measurements are obtained by max- 
imizing the likelihood L of the proper time distributions of mixed and unmixed events 
with the amplitude of the oscillating term A as the free parameter, and uj fixed at a 
chosen value. Denote £ = — logL, its expansion at second order around the minimum of 
C, C W (A) can be approximated by 

c u (A) - £*(A) + U— , (i) 



2\ a A 

where A is the measured value of the amplitude, and a A is the uncertainty on A. This 
approximation turns out to be very accurate in reality, C W (A) being parabolic in a wide 
range around A. 

From Eq. [I] it follows that, again for each value of uj: 

C^(A = l) ~ CM) + \{^f) • (2) 

The oscillation vanishes for A = on the one hand, and it averages to zero for uj — > oo 
due to finite resolution on the other. Therefore, the following equality can be written 

A^oc(any.4) = £ any U (A = 0) (=£«,), 

and therefore from Eq. p], 

£oc =^ + \{^j ■ ( 3 ) 
If Eq. [| is subtracted from Eq. ||], the following formula is obtained 

A£(uj) = C W (A = 1) - C ' ' ' ~ A 




(4) 
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Figure 1: Current combined amplitude measurements as a function of u, from the B 
Oscillation Working Group. 

which allows the value of AC to be calculated, for each u, from the fitted amplitude and 
its uncertainty. This formula was already given in Ref. ||. 

In the derivation which follows, the total uncertainties on the measured amplitudes 
are used, which is in principle not rigorous, since Eq. |l] is valid only for the statistical 
part. This fact is not considered to be a problem, since most of the uncertainties quoted as 
systematic errors in the analyses combined are of statistical nature. Typically, for inclusive 
analyses, the largest sources of uncertainty on the amplitude measurement are the fraction 
/b s of B s produced in the b quark hadronization, the b hadron lifetimes and their mean 
energy. All these parameters are measured, and their experimental errors are propagated 
as systematic uncertainties on A. In some analyses the parameters considered as sources of 
systematic errors were even added as free parameters in the fit, with Gaussian constraints 
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coming from external measurements. The part of the error identified as systematic was 
thus found out a posteriori by running the minimization with those parameters fixed. In 
addition, in the region of interest (a? > 10 ps^ 1 ) the statistical errors are dominant. The 
approach adopted should therefore be adequate. 

The likelihood difference A£(u) obtained for the data is shown in Fig. |2|. A good 
parametrization for the shape of AC is obtained with a function f(u>) oc l/uj a with 
a = 1.64, plus some Gaussian functions to describe the deviations. A parabolic fit of 
the three lowest points of the plot gives a minimum for u = 14.8 ps -1 , with a value 
2.9. The A£ min + 1/2 and A£ min + 2 levels are crossed at (14.3 - 15.3) ps 1 
and (13.0 — 17.5) ps -1 values, respectively, giving: 

u = U.8 ±0.5 ps" 1 (A£ min + l/2 interval), 
uj = 14.8 tl'l P s_1 (A£ min + 2 interval) . 

These intervals would give the ±1 o and ±2<j uncertainties if the likelihood profile were 
parabolic in a range wide enough around the minimum. The A£ m i n + 9/2 level is crossed 
on the lower side only, at to = 12.1 ps -1 . 

As discussed in the following sections, the significance of this minimum cannot be 
extracted in an analytical way, but needs to be determined with toy experiments. 



3 The amplitude analysis 

The true proper time distribution of mixed and unmixed B meson decays is written as 
follows: 

<P° (t ) TpW TV ^ I**™ Am t _ E°{t ) ± f° Am (t ) 
I u ,mW - 1 exp (— 1 1 ) = , 

where f Am {to) contains the oscillation term. The plus (minus) sign holds for unmixed 
(mixed) events. Any difference in the decay widths of the two mass eigenstates has been 
neglected. 

The reconstructed proper time distributions can then be written as: 

Jo z z 

For the sake of simplicity, no time dependent selection efficiency has been considered 
in the calculations. In what follows, and throughout this paper, it is assumed that the 
relative uncertainty on the b hadron momentum, and the absolute uncertainty on the 
decay length are Gaussian. This approximation follows what typically happens in real 
analyses, where the uncertainty on the reconstructed b hadron momentum is found to 
roughly scale with the momentum itself, while the uncertainty on the decay length does 
not. This has important consequences in the way the two resolution components affect 
the amplitude shape. 
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Figure 2: Likelihood as a function of uj derived from the combined amplitude measure- 
ments. A minimum is observed for u = 14.8 ps _1 . The parametrization described in the 
text is shown in (a) and (b); the parabolic fit to the three lowest points in (c). 
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Under these assumptions, the resolution function lZ(to,t) can be written as: 



K{t ,t) 



[Sf + {5 p tf} 




1 



exp ^ 
= exp 




{p-Po) 2 \ 1 



2[5f + (5 p ty]) 



2 a 2 J y^ ai 



exp 



(pet -p ct ) 2 \ pc 
2 (mai) 2 J m 



(6) 



where 5i = aim/(poc), 5 P = cr p /po. The approximation is valid if 5 P is significantly 
smaller than one, which is anyway required to assume Gaussian errors, since the recon- 
structed momentum cannot be negative. Furthermore, po is not accessible in real data; 
the reconstructed momentum is therefore used in the evaluation of the error from the 
decay length resolution: 5i ~ crim/(pc). 

A set of parameters is chosen here for the purpose of illustration. Resolution values of 
5 p = 0.15 and 5i = 0.14 ps are used; the latter one would correspond to a monochromatic 
sample of B s with p = 32 GeV/c and <j\ = 250 /mi. In a real analysis the normalization 
of the non-oscillating component is the total number N of b decays (differences in life- 
time are neglected), while the oscillation term is multiplied by N f Bs (1 — 2 77), / Bs being 
the fractions of B s in the sample and r\ the global mistag rate. For an inclusive analysis 
/b s (1 — 277) is typically about 0.05. The curves obtained with these parameters, normal- 
ization factors omitted, are shown in Fig. [3]. As the frequency increases, the oscillation 
amplitude is damped because of the resolution. For very large frequencies only the first 
period can be resolved. 

The fitting technique commonly used in the amplitude analysis is a simultaneous 
maximum-likelihood fit to the proper time distributions of mixed and unmixed events. 
Alternatively, the difference of the two distributions, i.e. the oscillating term, can be fit 
with a binned x 2 method. The two methods are discussed in the following. 

The maximum likelihood fit. Using the aforementioned formalism, the likelihood 
function can be written as: 



where again Am is the frequency of the oscillations in the sample analysed, and u is the 
value chosen in the fitting function. The minimization with respect to A leads to the 
condition 




[E{t) + f Am {t)} \og[E(t) + AfUt)] 

[E(t) - f Am (t)] log [E(t) - Af u (t)] + Const , 



+ 





which allows A to be determined. 
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Figure 3: Reconstructed proper time distributions for the non-oscillating component, 
E(t), and the oscillating component, fAm(t), at different values of Am. Resolutions of 
5 P = 0.15 and 5i = 0.14 ps are assumed. 
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The x 2 fit. Similarly, the \ 2 can be written as 



X 



the minimization of which gives 



°° d \fAm(t)-AUt)Y 

E(t) 



-J* W) = () 

Equations [7| and § both give A = 1 for u = Am. For uj ^ Am they are equivalent if 
Af u (t) is negligible compared to E{t). 

The expression of A& m (y}) can be derived from Eq. || as 



00 ^ fjfi UrrM 
E{t) 

-00 



(9) 



The resulting amplitude curves for Am = 5, 10, 15 ps _1 are shown in Fig. |]a. On top of 
the curves, values obtained from the likelihood fit (Eq. [?]) are also shown. The two fitting 
methods are indeed equivalent for u> ~ Am, as expected, while some difference appears 
for uj 7^ Am. 

The expected amplitude is unity at uj = Am. For uj > Am the behaviour depends on 
Am (for given resolutions). In this example, for Am = 15 ps -1 the expected amplitude 
increases monotonically. 

The expressions derived for the x 2 fit allow the expected error on the measured am- 
plitude to be also extracted, 



which in turn gives 



X 2 (A + a A )- X 2 (A) 
1 

a A (uj) = 



\ 



-00 E(t) 



(10) 



The significance of the measured amplitude is therefore: 

AAm(uj) 



00 d Ut)f Am (t) 

E(t) 



a 



Am 1 
A 1 



\ 



E(t) 



This latter equation is correct only because A and 04 are independent. 
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Figure 4: (a) Expected amplitude values for Am = 5, 10, 15, ps^ 1 . The curves refer to 
the x 2 minimization, the points to the likelihood fit. 

(b) Amplitude significance curves (x 2 fit). The expected shape of <j{uj) is also shown. 

(c) Expected shape of the likelihood, derived from the amplitude and its error. The 
dashed line corresponds to the the limit Am — > oo. 

Resolutions of S p = 0.15 and a t = 250 jim are assumed for the 3 plots. 
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The amplitude significance curves for Am = 5, 10, 15 ps -1 are shown in Fig. |]b. The 
normalization of the error, in the same figure, is arbitrarily chosen to have cr^ = 0.5 at 
L) = 15 ps -1 . 

The expected significance is maximal at uj = Am. For u > Am it decreases without 
reaching zero in the range explored. The decrease is more smooth for high values of Am. 

The expected shape of the likelihood, as calculated from the amplitude and its error 
using Eq. |J is shown in Fig. |c. 



3.1 Limits for small and large Am 

In the limit of very small or very large Am, some approximations can be made in the 
formulae, which yield simplified expressions of easier interpretation. 



Small Am. If Si <C 1/Am, S p /T <C 1/Am, the oscillation is slow and marginally 
affected by the resolution. This limit holds in the case of oscillations. If the resolution 
effects are neglected, Eq. |] can be rewritten as 







dt rexp(— Ft) cos tot cos Ami 



which gives 



/ dt T exp(-Tt) cos uut 
Jo 



r 2 r 2 
+ 



1 



r 2 + Au 2 

The resulting shape is shown in Fig. [|. The dots superimposed are obtained with toy 
experiments generated at the same value of the frequency, including resolution effects (for 
details on the simulation see Section |4j; the parameters used in the generation are those 
of samples S there defined). The two shapes are in qualitatively good agreement. 



Large Am. In this limit, which corresponds to the regime of B s oscillations, the res- 
olution effects dominate. If Am s rs 15 ps -1 and S p = 0.15, then S p /T ~ 0.23 ps, which 
is larger than 1/Am s rs 0.07 ps and therefore implies that only events with small proper 
time contribute to the sensitivity. Similarly, taking Si = 0.14 ps gives Si > 1/Am s , which 
implies a substantial damping of the amplitude of the oscillating term due to the decay 
length resolution. In this useful approximation is to assume that the term E(t) 

in Eq. ^ varies slowly compared to the fast oscillating term which is nonzero in a 

limited time range (Fig. |^), and take it out of the integral. In this way the expression 
can be simplified and rewritten in terms of the Fourier transformation of the oscillating 
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Figure 5: The full curve gives the expected shape of the amplitude for a signal at 
Am = 0.5 ps -1 when all resolution effects are neglected. The dots are obtained with a toy 
experiment in which resolution effects are simulated (where 5 P = 0.15 and 07 = 250 fj,m). 
The two shapes are in agreement. 



components, 



/oo poo _ 

dt fu(t) f Am {t) J du f„(u)f Am (i/) 

*A-A m {UJ) ~ °° r oo = °° roo • (12) 

dt flit) / du L (u) 



The approximation is valid only if both uj and Am are large. The functions f w are 
shown in Fig. |^a for a few different values of u > 10 ps -1 . Fig shows the product of 
two of these Fourier transformations to illustrate the behaviour of the ratio in Eq. |1^. 

With increasing Am, the frequency spectra, f w , become broader and smaller in ampli- 
tude. High true frequencies, Am, have their spectrum damped faster than low frequencies, 
and the peak at uj pa Am disappears for Am well beyond the sensitivity (Fig. |6|b). Due 
to the broadening of the spectra, the product f 0J1 (h / )i u)2 (i / ) is peaked around the smallest 
between oj\ and uo 2 (Fig. ^|c). This fact implies that when a sample with oscillations at 
frequency Am is analysed with a function containing a frequency uj < Am, the measured 
amplitude is dominated by the frequencies around uj; therefore the shape of A& m {uj) for 
uj < Am resembles that of f&. m {oj). For uj > Am the frequencies around Am are al- 
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Figure 6: (a) Expected shapes of the Fourier spectra f& m for different values of Am. The 
spectra become broader and lower in amplitude when Am increases. 

(b) Detail of the spectra for high Am. 

(c) Products of pairs of Fourier spectra. The resulting functions are peaked around the 
smallest of the two frequency values. 

Resolutions of 5 P = 0.15 and 07 = 250 /im are assumed. 
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ways tested, with a normalization factor which increases with u (Eq. O, the denominator 
decreases very fast), therefore A^ m {oj) increases monotonically. 



In order to understand better the effect of the decay length and proper time resolution, 
it is useful to study them separately. Setting 5 P = in Eq. || the following simplified 
expression can be obtained, 



f. 



1 

2 



r 2 



+ 



r 2 + (w + Am) 2 r 2 + (w - Am) 2 



exp 



which shows that the decay length resolution is responsible for the damping of the high 
frequencies. 

Considering the momentum resolution alone, the following expression is obtained, 



oo I 

dv - 

-oo 2 



r 2 



+ 



r 2 + [y + Am) 2 r 2 + [y - Am) 



exp 



which shows that the momentum resolution causes the broadening of the frequency spec- 
trum (as intuitively expected, since a shift in the reconstructed momentum is equivalent 
to a change of scale on the time axis). For Am = 15 ps -1 and 8 P = 0.15, the width of the 
frequency spectrum is dominated by the momentum resolution. 

A broader frequency spectrum corresponds to a broader structure in the amplitude 
spectrum, or, equivalently, to higher correlations between values of the amplitude mea- 
sured at different frequencies. This property is relevant for the confidence level estimation 
as explained in Section 



3.2 Fluctuations 

The expected shape of the likelihood for a sample with oscillations at a frequency far be- 
yond the sensitivity is shown in Fig. f|c. In a given frequency range, statistical fluctuations 
of the likelihood can produce values below which can fake a signal. The probability of 
observing that AC is lower than a given value AC at a given frequency u can be estimated 
from Eq. |], using the fact that the errors on the measured amplitudes are found to be 
Gaussian with high precision: 



V(AC,uj) = V(AC<AC) UJ = -erfc 

The function V(AC, u) is shown in Fig [j]a, where the same parameters and normaliza- 
tion as for Fig. |] are used. This function can be used as an estimator of the signal-ness 
of a given sample. Estimator contours, equidistant on a logarithmic scale, are drawn in 
Fig. [7|b. Small negative values are most probable at high frequencies, while for larger 
negative values the maximum of the probability is found at lower frequencies. 



-AC a A {uS) + 



1 



V2 



(13) 
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Figure 7: (a) Probability to find a AC value lower than AC at a frequency u. 

(b) Contours of equal probability in the (AC,u) plane. Values between —3 and —5 are 

found with higher probability when the error on the amplitude is from 0.5 (at ou — 15 ps _1 

in this example) to 0.3 (cu ~ 11 ps _1 ). The contours shown are equidistant on a logarithmic 

scale. 

Resolutions of 5 P = 0.15 and 5[ = 0.14ps are assumed. 
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4 The toy experiments 



The probability that the minimum observed in the likelihood (Fig. ^) is caused by a fluc- 
tuation can be evaluated by means of toy experiments with the above estimator (Eq. |r5| ). 

In a general case, the depth of the likelihood minimum can be translated to a statistical 
significance in the approximation that the likelihood is parabolic, which is not the case 
here. 

At each frequency point, the probability that the measured AC is lower than a given 
value AC can be calculated as explained in Section [T2], starting from the errors on the 
measured amplitude. This procedure cannot be applied to the minimum, since A£ min is 
not an "unbiased" value, but it is chosen as the lowest value found over a certain frequency 
range explored. 

The sum of the probabilities of obtaining a likelihood value lower than observed at 
all points where the amplitude is measured does not provide a good estimate either. The 
different points are highly correlated and they cannot fluctuate independently, therefore 
the sum of the individual probabilities would give a gross overestimate of the overall 
probability of finding a minimum as or more unlikely than the one observed. 

The only viable possibility is to calibrate the significance of the structure observed 
with the help of toy experiments. The worldwide combination includes many analyses, 
and a detailed simulation of each of them is highly impractical. The procedure adopted 
here is to choose a set of parameters for the generation of the toy experiments such that 
each experiment is equivalent to the world average. The set of parameters cannot be 
uniquely determined from the data: it turns out that some parameters need to be fixed 
a priori, and therefore the dependence of the result obtained upon the particular choice 
adopted needs to be understood. The possible effects of the lack of a detailed simulation 
are investigated in Section by studying the dependence of the correlations in the 
amplitude measurements upon the parameters chosen to generate the toy experiments. 

4.1 Generation 

The basic features of the toy experiments used to estimate the significance of the likelihood 
minimum can be summarized as follows. 

• Bottom hadron species are generated according to a chosen composition. 

• For each species, the true proper time to of each b hadron is generated according 
to an exponential with decay constant equal to its width, T, multiplied by a given 
efficiency function. 

• Neutral B mesons are allowed to mix. Mixed and unmixed particles have their 
proper time distributions modified by the appropriate oscillating term, with given 
frequency. 
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• The true momentum po is generated according to a Peterson distribution, tuned to 
reproduce a given mean scaled energy (rc 6 ). 

• The true decay length is then obtained, for each b hadron, from 

*o = c. 

m 

• A smearing is applied to the true decay length and momentum according to given 
resolution functions, to obtain the measured decay length and momentum, I and p. 

• The measured proper time is hence calculated as 

Im 
pc 

• A mixed/unmixed tag is assigned to the generated hadrons using specified mistag 
rates. 

• The udsc background is neglected. 

4.2 The choice of the parameters 

The only information, at the level of the world combination, which can drive the choice of 
the parameters for the simulation is provided by the errors on the measured amplitudes. 
The step at uj = 15 ps -1 is due to some analyses in which the scan was not performed 
beyond that value of the frequency. The step at uj = 19 ps _1 is due to the SLD analyses, 
for which no measurement was provided for uj > 19 ps -1 . In all what follows the four 
points with uj > 19 ps -1 are ignored, in order to reduce the pathologies in the error shape. 
The errors on the measured amplitudes can be formally written as (see also Eq. [10]) 



g a \uj) = y/Nf BB (l-2r } )X(5 p ,<T l ,u). (14) 

The factor k = yN /b s (1 — 2 77) gives the normalization of the error distribution, without 
affecting the shape, and obviously the three parameters can not be disentangled. It is 
chosen to fix /b s and 77 to some "typical" values (namely /b s = 0.15, 77 = 0.25), and adjust 
iV to fit the data errors. The effect of a different choice which yields the same k value is 



investigated in Section 5.1 



The decay length and momentum resolution terms both affect the shape of the mea- 
sured error as a function of uj. The sensitivity is not enough to get a reliable simultaneous 
determination of both. It is thus chosen to fix 5 p , again, to a "typical" value of 5 P = 0.15, 
and tune the value of o\. This choice is preferred because, as explained later, 5 P plays 
an important role in the determination of the confidence level, and needs anyway to be 
varied over a wide range to check the stability of the result obtained. 
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Figure 8: (a) Optimization of the number of events for three different values of the decay 
length resolution. 

(b) Choice of the optimal decay length resolution. 



Samples are generated at three starting points for 07, which are chosen to be 200 /im, 
250 /im and 300 fim, each with 30000 events and the other parameters as described above, 



and reported in more detail in Section 4.3 



For each value of o~i, the number of events is tuned by minimizing the sum of the 
differences with the data errors, 

E/^data toy\ 2 

i 

where the scaling low of Eq. [14] is used (Fig. |8|a). The three minima found are then 
compared and interpolated with a parabolic fit (Fig. |8|b) to find the absolute minimum, 
which turns out to be very close to 250 /im. The number of events needed at this point is 
34000. 

4.3 Samples description 

On the basis of the procedure described in Section P~2|, a set of parameters S is defined 



as follows: 

• a single purity class: 15% B s 38% B d 38% B+ 9%A b ; 

• a single tagging class: mistag rate 77 = 25% for all species; 

• a single resolution class: 

a 1 = 250 am | , ;1 „, . . ;1 

, „ _ > both Gaussian with no tails ; 
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Figure 9: Shape of the reconstruction efficiency as a function of true proper time. The 
normalisation of the vertical scale is arbitrary. 



• Monte Carlo parametrized efficiency (taken from the analysis of Ref. ||). The curve 
is shown in Fig. [| 

• b hadron masses and lifetimes, and Amj from Ref. |J; 

• (x b ) = 0.7; 

• Am s fixed at different values, according to the study considered; 

• statistics of 34000 b hadron decays. 

A second set of parameters S' is defined to generate a second family of toy experiments. 
The momentum resolution is chosen to be <J p /po = 0.07, which is significantly better than 
what is typically achieved in inclusive analyses. In order to keep the agreement with the 
world average errors on the measured amplitudes, the number of events is reduced to 



29000 (obtained with the procedure described in Section |4.2|) . The other parameters are 
left unchanged. These experiments are used in the following to investigate the dependence 
of the confidence level upon the momentum resolution. 

The errors on the amplitude, cr^, obtained with these two sets of experiments are 
compared to the errors from the combined data in Fig. [nj. The step at u — 15 ps~ x could 
be reproduced by averaging, for each "experiment" , two "analyses" , of which one has its 
scan stopped at that point. No attempt was made in this direction. 

A third set of samples S" with 5 P = 0.15, 07 = 200 /im and statistics of 16500 decays 
(which correspond to the optimization of Fig. §) is used to investigate the dependence of 
the confidence level upon the decay length resolution. 
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Figure 10: Amplitude errors comparison: simulated experiments versus world average 
data. 



Finally samples of type s are defined from samples S by increasing /b s by a factor of 
five (i.e. having /b s = 0.75) and reducing the statistics by a factor of 25 (which gives 1360 
decays) . 

In Fig. 11 the expected shape of the amplitude and the likelihood is shown, as obtained 
by averaging 1000 samples of type S, generated with Am s = 150 ps -1 . The expected value 
is consistently zero, and the errors on A are Gaussian, which confirms the validity of the 
amplitude method to set limits on the oscillation frequency. 



5 The estimate of the significance 



As demonstrated in Section 3.2, the probability that, at a given point in the frequency 



scan, a value of the likelihood AC < AC be found, can be calculated, given AC, from the 
error on the measured amplitude, which is available from the data. 

For the purpose of establishing the significance of the minimum, however, this proba- 
bility is not enough, since what is needed is the probability that anywhere in the range 
explored a configuration more unlikely than the one observed may appear (in the hypoth- 
esis of large Am s ). This significance is driven not only by the errors, but also by the 
correlations between the amplitude measurements at different frequencies, which are not 
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Figure 11: (a) Expected amplitude and error for samples of type S, with Am s = 15ps _1 , 
as a function of uj. 

(b) Expected likelihood shape. The plots are obtained by averaging 2000 samples. Reso- 
lutions of S p = 0.15 and o\ = 250 fim are assumed. 

controlled from the data, and might depend on the particular combination of parameters 
chosen for the simulation. It is therefore mandatory to identify the most relevant sources 
of systematic uncertainty which might affect the extraction of the confidence level. This 
point is investigated in the next section. 

5.1 Correlations 

From the discussion of Section [J[ it turns out that the momentum resolution is the most 
critical parameter to determine the point-to-point correlation in the amplitude scan. In 
a sample with better momentum resolution, correlations are smaller and therefore the 
probability of having significant deviations from A = in a sample with no signal is 
larger, in a given frequency range explored. 

In order to investigate the dependence of the point-to-point correlation upon the pa- 
rameters used in the generation, a sensitive quantity is the average difference between 



20 



amplitudes measured at two given points in the frequency scan. If there were no correla- 
tions, this difference could be written in terms of the errors on the amplitude as 

( | >Aj | ) 

Correlations reduce this value if i and j are close enough. A scan in steps of 0.25 ps _1 is 
assumed, as for the data analyses. 




For each of the four set of parameters, S, S', S" and s defined in Section [0|, 150 
samples are produced, and the quantity ( | A% — Aj | ) is calculated, for i—j = 1, 4, 7, 10. 
The results are shown in Fig. where they are compared with the expectation for no 
point-to-point correlation. 

Compared to the most "realistic" samples, S, the largest deviation is observed, as 
expected, when the momentum resolution is changed (samples S'). At low u, the difference 
between the no-correlation limit (curve) and the values found in the simulation (markers), 
decreases rapidly as the distance between the points increases: for i — j = 4 Au = 
lps -1 ) it is reduced by about a factor of two compared to i — j = 1, so Au = 1 ps _1 
can be taken as an estimate of the "correlation length" at small frequencies. When u 
increases, the difference between the curve and the simulation remains substantial even 
when the points are a few ps _1 apart, demonstrating the increase of the correlation length 
with ui. 

Samples S' can be used to estimate a "systematic uncertainty" on the confidence level 
obtained, coming from the specific choice of the parameters used in the simulation. 



5.2 The Confidence Level 

The significance of the minimum observed in the AC distribution (Fig. ^) is estimated by 
computing the probability that a structure as or more unlikely is observed in a sample 
with Am s far beyond the sensitivity. 

In order to do that, it is taken into account that the probability of observing a given 
value of AC is a non-trivial function of uj. Probability contours in the (AC,u) plane (as 
in Fig. 0b) are built from the data errors. The contour corresponding to the data sample 
is computed. iV ex p = 2000 samples of type S with Am s = 150 ps~ x are analysed and the 
number of those that give a minimum A£ min < outside the contour corresponding 
to the data is recorded. Since the expected value of the likelihood is positive for all 
frequencies (see Fig. |Tl|b), occasionally the minimum in the range — 19 ps^ 1 is also 
positive. These minima are not counted, independently of the frequency at which they 
occur, since thay can not be interpreted cLS cL SI gnal of oscillations. 

The population of the toy experiments in the (AC, u) plane along with the point 
corresponding to the data sample, is shown in Fig. [13]. 

The confidence level is computed as 

N out 

1-C.L. = = 0.021 ±0.003. (15) 
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Figure 12: Point-to-point fluctuations for four sets of samples. From top to bottom, 
the average difference between points distant 1-4-7-10 steps in the amplitude scan are 
shown. The lines correspond to the limit of no correlation between points. 
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Figure 13: Minima of AC for 2000 samples of type S, with Aro s = 150 ps -1 . The curves 
represent contours of equal probability of observing a value of AC smaller than AC, as a 
function of u (as in Fig. 0). 

The study is repeated with 2000 samples of type S', and yields 

1-C.L. = 0.033 ±0.004. (16) 

This value has to be understood as a conservative estimate of the probability of statis- 
tical fluctuations, since it is obtained with experiments built to have lower point-to-point 
correlations than that expected for the average of real analyses. The distribution of the 
minima for this case is shown in Fig. |14|. The difference between the values of Eq |1^ 
and Eq. [16| gives an upper limit for the uncertainty coming from the lack of a detailed 
simulation. 

The probability that the current result of the world combination of B s oscillation 
analyses is due to a statistical fluctuation can be therefore quantified to be around 3%. 
The uncertainty on this number coming from the inaccuracies of the simulation is below 
1%. 
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Figure 14: Minima of AC for 2000 samples of type S' with Am s = 150 ps 1 . The same 
curves as in Fig [13] are shown. 



5.3 Comparison with the oscillation hypothesis 

In order to check that the amplitude spectrum observed in the data is in qualitative 
agreement with the hypothesis of oscillations, 500 samples of type S have been produced, 
with Am s = 14.8 ps" 1 . The expected amplitude and error at each frequency value are 
shown in Fig. [15|, with the data points superimposed. The agreement is good over the 
whole frequency range. 

A quantitative study of the compatibility of the data with the signal hypothesis would 
require to perform a fine scan on Am s with many samples at each value, in order to define 
a probability that the results observed are produced by an oscillation with a frequency in 
the range explored. This kind of study is not attempted here. 

A simple check is performed instead. The 500 samples with oscillation at a value 
Am s = 14.8 ps" 1 are analysed in terms of their incompatibility with the no-oscillation 
hypothesis. The scatter plot of the likelihood minima in the (AC, uj) plane, as for the 
samples with Am s = 150 ps" 1 , is presented in Fig |i~6|. 

An enhanced density in the region 14 ps -1 < u < 16 ps" 1 , —3 < A£ mm < —1 is shown 
in the plot. A cluster of experiments with minima at uo = 19 ps" 1 is also clearly visible: 
for these experiments the lowest point of the likelihood was at the boundary of the region 
analysed. Experiments with A£ m j n < — 5 appear at frequencies lower than the true one, 
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Figure 15: Average amplitude and expected error as a function of 00 for a signal at 
Am s = 14.8 ps _1 . The amplitude values, obtained by averaging 500 toy experiments, are 
in good agreement with the data measurements (solid points). 



Figure 16: Minima of AC for 500 samples of type S, with Am s = 14.8 ps 1 . The same 



curves as in Fig [13] are shown. 
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where fluctuations which can produce deep minima are more likely. 

Out of these 500 samples, 80 were found outside the estimator contour corresponding 
to the data, which gives a probability of 16%. If the data results were perfectly "typical" 
compared to the toy samples, the expected result would be 50%. 

6 Conclusion 

The likelihood profile as a function of uj derived from the combined amplitude mea- 
surements available at the time of the 1999 Winter Conferences shows a minimum at 
uj = 14.8 ps -1 . The depth of the minimum compared to the asymptotic value for uj — > oo 
is A£ min = —2.9. The intervals at which the A£ min + 1/2 and A£ min + 2 levels are 
crossed are: 

lu = U.8 ±0.5 ps" 1 (A£ min + l/2 interval), 
uj = 14.8 tli ps _1 (A£ min + 2 interval) . 

The significance of the minimum observed in the likelihood cannot be calculated an- 
alytically, but needs to be calibrated using toy experiments. With the method proposed 
here, the probability that the result observed is produced by a statistical fluctuation in a 
sample with no signal is found to be: 

1-C.L. « 3%. 

The uncertainty on this estimate coming from the lack of a detailed simulation of the 
individual analyses contributing to the average is estimated to be below 1 %. 
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